Universal Access To All Knowledge
Home Wayback Machine | Archive-It | Blog | Heritrix
Search: Advanced Search
Anonymous User (login or join us) Upload

Most Downloaded Items
Last Week more

  1. Webwide Crawldata 2011-03-09T00:28:53PST to 2011-03-11T13:42:25PST
    13 downloads
  2. Webwide Crawldata 2011-03-13T18:04:52PDT to 2011-03-13T18:17:45PDT
    10 downloads
  3. Webwide Crawldata 2011-03-20T00:45:19PDT to 2011-03-19T23:59:45PDT
    10 downloads
  4. Webwide Crawldata 2011-04-10T23:07:53PDT to 2011-04-13T17:04:46PDT
    10 downloads
  5. Webwide Crawldata 2011-04-14T13:58:15PDT to 2011-04-14T10:28:36PDT
    10 downloads

Most Downloaded Items more

  1. Liveweb Capture 2011-06-04T21:50:17PDT to 2011-06-04T20:29:29PDT
    302 downloads
  2. Webwide Crawldata 2011-05-01T03:04:05PDT to 2011-05-01T00:35:28PDT
    293 downloads
  3. Liveweb Capture 2011-06-05T09:32:50PDT to 2011-06-05T09:19:46PDT
    285 downloads
  4. Webwide Crawldata 2011-03-09T00:28:53PST to 2011-03-11T13:42:25PST
    284 downloads
  5. Webwide Crawldata 2011-03-11T21:42:25PST to 2011-03-12T01:22:38PST
    284 downloads

Spotlight Item

Liveweb Capture 2011-06-04T21:50:17PDT to 2011-06-04T20:29:29PDT
Internet Archive Liveweb Capture from WaybackMachine, captured by wwwb-proxy0.us.archive.org:wbm from Sat Jun 4 21:50:17 PDT 2011 to Sat Jun 4 20:29:29 PDT 2011.

About the Internet Archive

Background

Frequently Asked Questions

144,843 itemsWelcome to Internet Archive Web Crawls

Crawl data collected by the Internet Archive. This data is currently not publicly accessible in this format. To view archived web pages, please visit the Wayback Machine.


Browse by Subject / Keywords

All items (most recently added first) - RSS

Sub-Collections

2004 Election
2004 Election crawl performed by Internet Archive. This data is currently not publicly accessible.
2004 Indian Ocean earthquake and tsunami
Data related to the 2004 Indian Ocean earthquake and tsunami collected by Internet Archive. This data is currently not publicly accessible.
Amazon
Amazon.com data collected by Internet Archive. This data is currently not publicly accessible.
COM Survey Crawls
Survey crawls of .com domains. This data is currently not publicly accessible.
2,680 items
Crawl Data
Crawl Data. This data is currently not publicly accessible.
7,747 items
Edu & Gov Crawl, June 2010
TEST COLLECTION: Crawl of .edu and .gov sites started in June 2010.
1 items
FS Fed US
Data collected in 2005 by Internet Archive. This data is currently not publicly accessible.
Fundacao para a Computacao Cientifica Nacional
NASA-related data collected by Internet Archive. This data is currently not publicly accessible.
Geocities Closing Crawl
Geocities crawl performed by Internet Archive. This data is currently not publicly accessible.
31 items
Google Video
Content crawled from video.google.com prior to shut down.
7,171 items
Hurricane Katrina
Data related to Hurricane Katrina collected in 2005 by Internet Archive. This data is currently not publicly accessible.
Inktomi 2001
Data collected in 2001. This data is currently not publicly accessible.
International News Crawls
Crawls of International News Sites
1,156 items
Live Web Proxy Crawls
Content crawled via the Wayback Machine Live Proxy.
2,472 items
Mandriva Crawl
Mandriva.com crawl performed by Internet Archive. This data is currently not publicly accessible.
National Oceanic and Atmospheric Administration (NOAA)
Demo crawl for National Oceanic and Atmospheric Administration (NOAA). This data is currently not publicly accessible.
National Science Digital Library
Demo crawl for the National Science Digital Library. This data is currently not publicly accessible.
Net & Gov Crawl
Net and Gov crawl performed by Internet Archive. This data is currently not publicly accessible.
NET Survey Crawls
Survey crawls of .net domains. This data is currently not publicly accessible.
500 items
Nigerian Election
Data related to Nigerian elections, 2001 collected by Internet Archive. This data is currently not publicly accessible.
NL TV
Data collected in 2005. This data is currently not publicly accessible.
Open Sky
Demo crawl of scientific data. This data is currently not publicly accessible.
ORG Survey Crawls
Survey of .org domains. This data is currently not publicly accessible.
212 items
September 11th
Data related to September 11th, 2001 collected by Internet Archive. This data is currently not publicly accessible.
Standards
Standards crawl data collected by Internet Archive. This data is currently not publicly accessible.
To Crawl
Data collected by Internet Archive. This data is currently not publicly accessible.
Top 150 Crawl
Top 150 Alexa sites crawl performed by Internet Archive. This data is currently not publicly accessible.
UK Government Site Crawl
Collaborative closure crawl of British government sites performed by Internet Archive. This data is currently not publicly accessible.
University of Michigan
Data collected by Internet Archive on behalf of University of Michigan. This data is currently not publicly accessible.
VOX.com Crawl September 2010
Crawl of vox.com, September 2010. This was an attempt to preserve vox.com content as much as possible in the wake of service closure, September 30, 2010.
28 items
Wayback Indexes
Wayback indexes. This data is currently not publicly accessible.
Wayback Robots Crawl
Wayback robots.txt crawl performed by Internet Archive. This data is currently not publicly accessible.
Wide Crawls
Wide crawls of the Internet conducted by Internet Archive. Access to content is restricted. Please visit the Wayback Machine to explore archived web sites.
85,614 items
Wiki Crawl
Wiki data collected by Internet Archive between 2006-2008. This data is currently not publicly accessible.
Wikipedia Outlinks
Crawl of outlinks from wikipedia.org. These files are currently not publicly accessible.
4,286 items
World Wars Crawl
Web data related to World Wars I and II collected by Internet Archive in an experimental crawl sponsored by National Endowment for the Humanities and JISC. This data is currently not publicly...
Yahoo! Video Crawl
Pages captured from Yahoo! Video prior to removal of user uploads. Crawl Started February 2011. This data is currently not publicly accessible.
4,658 items
Youtube Crawl
Crawl data from youtube.com. These files are currently not publicly accessible.
28,268 items

Recently Reviewed Items (more)

This Just In (more)

YouTube Video Crawldata 2012-06-02T19:43:20PDT to 2012-06-02T13:51:01PDT
9 minutes ago

Webwide Crawldata 2012-06-02T18:59:57PDT to 2012-06-02T13:58:20PDT
12 minutes ago

Webwide Crawldata 2012-06-02T19:02:23PDT to 2012-06-02T13:52:32PDT
16 minutes ago

Webwide Crawldata 2012-06-02T18:13:13PDT to 2012-06-02T13:40:43PDT
19 minutes ago

Webwide Crawldata 2012-06-02T19:03:52PDT to 2012-06-02T13:27:06PDT
20 minutes ago


  

Terms of Use (10 Mar 2001)